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1. The generic protein stiffness potential and secondary 
structure bias 

As defined above, the model chain was intrinsically very flexible. A 
substantial fraction of its conformations that were allowed due to the assumed 
simplified hard core interactions did not correspond to any real polypeptide chain 
conformation. In particular, proteins are relatively stiff polymers. Moreover, folded 
proteins have very characteristic distributions of certain short-range distances. For 
example, the bimodal distribution of the distances between the i-th and i+4 th residues 
reflects the tendency to adopt either of two types of conformations. These 
correspond to extended (P-type or extended coil) or very compact conformations (as 
within helices or turns). Such generic features need to be included in the model. 
Here, the SICHO model differs from that used in Example 1 due to the refined 
protein representation (a larger number of allowed chain vectors and a modified 
position of the center of interaction, that also included alpha carbons). 

First, for all possible two-vector sequences of the model chain, a direction w 
was defined that was almost perpendicular to the plane formed by the fragment. A 
small systematic deviation from the exactly orthogonal direction was introduced in 
w to obtain vectors that were, on average, parallel to the helix axis and which also 
accounted for the average supertwist of p-strands. 

uKv i -i®Vi-v i _ 1 -v i ) (1) 
wr=Uj/|ui| (2) 
where v* is the i-th vector (or virtual bond) of the model chain, the symbol "® M 
denotes the vector cross product and |ui| is the length of vector Uj. Consequently, 
these "directions of secondary structure" (the vectors w point along a helix or across 
a P-sheet) were normalized so that their length was equal to 1 . The idea is explained 
in Figure 12, where the model chain virtual bonds are shown in solid lines and the 
vectors are shown in open arrows. 

The stiffiiess/secondary structure bias has the following form: 
Estif^-SgentSminfO.S, max(0, w* • w^)}] (3) 
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-e ge n[£min{0.5, max(0, Wi • w^)}] 
Sgen is a constant energy parameter, common for all generic potentials, and Z means 
summation along the chain. The above formulation means that the system is 
energetically stabilized when pairs of "direction of secondary structure" vectors are 
parallel (positive dot product). As can be read from the above equation, the 
stabilization energy increased in the range between 90° and 30° (angle between 
appropriate vectors w) and then maintained its extreme value. Thus, small 
fluctuations of the secondary structure had no influence on the value of this 
potential, nor did the changes outside the regular conformations (negative values of 
the dot product) have any effect on the conformational energy. The minimum value 
of the stiffness function per residue was equal to e gen , and the maximum was 0. 

Additionally, a bias was introduced towards the specific geometry of helical 
and P-type expanded states (however, it was quite permissively defined). All 
conformations were, of course, allowed; the purpose of this bias was to mimic a 
protein-like (average) distribution of local conformations. Symbolically, this could 
be written as follows: 

Estruct = 2 {8Hl(i) + 8H2(i) + 8El(i) + 8E2(i)} (4) 

with: 

5H1 (i) = -Sgen » for t\h.4 < 36 and (v { «v M ) >0 and (v s »v M ) <-5 

0 , otherwise (4a) 

8H2(i) - -s g en , for r\ i+4 < 36 and (vj tv M ) >0 and (v i+ i <-5 

0 , otherwise (4b) 

5E1 (i) = -Sgen > for 56 < r\ i+4 < 1 35 and (v* •v i+2 ) > 5 

0 , otherwise (4c) 

5E2(i) = -s gen , for 56 < ^ < 135 and (y m •vm) > 5 

0 , otherwise (4d) 

The numerical values are in lattice units and were selected to define a broad range of 
helical/turn conformations (for the 8H1 and 8H2 contributions) or expanded 
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conformations (for the 5E1 and 8E2 contributions). Due to the exclusive character 
of the two subsets of geometrical conditions for specific chain conformations, the 
minimum contribution from a residue is equal to -2e gen (either the first two 
conditions or the two last conditions can be simultaneously satisfied). Expressed 
differently, equation (4d) indicated that the system gained an energy equal to -8 ge n 
for being in an expanded p-type conformation. For a four-vector fragment of the 
chain, this required that the distance between the i-th and i+4 beads (the centers of 
mass of the side chain plus Ca units) had to lie between 10.7 and 16.8 A, and the 
chain vectors v i+ i and v i+3 have to be oriented in a parallel-like fashion (the dot 
product >5). Additional stabilization is gained when, for the same fragment, another 
pair of vectors is parallel (see equation 4c). The broad ranges allowed for 
substantial fluctuations (without an energetic penalty) around an ideal expanded 
state and accommodated the variations of the model chain geometry caused by 
differences in side chain size. 

Computational experiments have also been performed where all interactions, 
except the ones defined above, were turned off. At low temperature, the model 
chain formed rapidly fluctuating local clusters of expanded and helix-like states. 
The persistence length and the distributions of the short-range distances along the 
chains mimicked protein-like geometry. 

2. Generic packing cooperativity 

Two terms were introduced to enforce some of the most general regularities 
of the dense packing of protein structures. 10 In all the more regular elements of 
secondary structure (within helices and p-sheets, but not between helices) and, to a 
lesser extent, in some coil-type fragments and turns, given a contact between a pair 
of reference residues, there was a very strong preference to have contacts (we 
provide precise definition of the "contact" later) between the preceding and the 
following residues. Indeed, the contact maps of globular proteins contain very 
characteristic strips. 22 Those near the diagonal corresponded to the intrahelical 
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